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Abstract 



We hypothesize that the correct power counting for charmonia is in the 
parameter Aqcd/^c, but is not based purely on dimensional analysis (as 
is HQET). This power counting leads to predictions which differ from 
those resulting from the usual velocity power counting rules of NRQCD. 
In particular, we show that while Aqcd/^c power counting preserves the 
empirically verified predictions of spin symmetry in decays, it also leads 
to new predictions which include: A hierarchy between spin singlet and 
triplet octet matrix elements in the J / ip system. A quenching of the net 
polarization in production at large transverse momentum. No end point 
enhancement in radiative decays. We discuss explicit tests which can 
differentiate between the traditional and new theories of NRQCD. 
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I. INTRODUCTION 



Quarkonia have proven to be fruitful in helping us gain a better understanding of QCD. 
For large enough valence quark masses the system should be dominated by Coulomb ex- 
change in the perturbative regime. Fortunately, the physical valence quark masses seem to 
be too small for the states to be truly insensitive to non-perturbative effects, and thus give 
a window on the more interesting aspects of QCD. In order to systematically study these 
effects we need to separate the long distance from the short distance physics. This can be 
accomplished by writing down a proper effective field theory to describe the infra-red. The 
theory should provide a power counting which determines which operators are relevant. In 
most effective theories this power counting is based upon dimensional analysis. However, 
for non-relativistic QCD (NRQCD) this is not the case. Instead, it is an expansion in the 
parameter v, the relative velocity of the valence quarks. This power counting presupposes 
that the states are Coulombic, at least to the extent that a s {mv) ~ v, and leads to the result 
that operators of the same dimension may be of different orders in the power counting. This 
methodology has been applied to the J /if) as well as the T systems. While it seems quite 
reasonable to apply this power counting to the T system, it is not clear, as we will discuss 
in more detail below, that it should apply for the J /if) system. Indeed, we believe that the 
data is hinting towards the possibility that a new power counting is called for in the charmed 
system. 

In Ref. IU, the authors showed how to utilize NRQCD to predict decay rates as well as 
production rates in a systematic double expansion in a s and v. These predictions have met 
with varying degrees of success. For instance, it is possible to explain J / if) and if)' production 
at the Tevatron, though the initial data on the polarization of these states at large transverse 
momentum || seems to be at odds with the NRQCD prediction. In addition, there is an 
unexpected hierarchy of matrix elements in the charmed system which does not seem to 
be there in the bottom system. Furthermore, there is a discrepancy between theoretical 
expectations and data for the end point spectrum of inclusive radiative decays. While we 
do not believe that any one of these pieces of evidence, on their own, is strong enough to 
warrant introduction of a new theory, it seems to us that the evidence, taken as a whole, 
seems to be telling us that the effective field theory which best describes the J /if) system 
may not be the same theory which best describes the T. The purpose of this paper is to 
present an alternative charmonium power counting, first discussed in |§ and later utilized 
to study the quark-antiquark potential [[|, which leads to predictions which seem to have 
better agreement with the data. 

II. BACKGROUND 

A general decay process may be written in factorized form 

T W = Y / C 23+ i Lj (m,a s )(if) | 0^( 2S+1 Lj) | if)). (2.1) 

The matrix element represents the long distance part of the rate and may be thought of 
as the probability of finding the heavy quarks in the relative state n, while the coefficient 
C*2s+i Lj (m, a s ) is a short distance quantity calculable in perturbation theory. The sum over 
operators may be truncated as an expansion in the relative velocity v. 
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Similarly, production cross sections may be written as 

da = J2 da i+J ^ QQ[n]+x (0 | | 0). (2.2) 

n 

Here d<T i+ j_>QQi n ] +x is the short distance cross section for a reaction involving two partons, 
i and j, in the initial state, and two heavy quarks in a final state, labeled by n, plus X. 
This part of the process is calculable in perturbation theory, modulo the possible structure 
functions in the initial state. The production matrix elements, which differ from those used 
in the decay processes, describe the probability of the short distance pair in the state n to 
hadronize, inclusively, into the state of interest. The relative size of the matrix elements in 
the sum are again fixed by the power counting which we will discuss in more detail below. 

The formalism for decays is on the same footing as the operator product expansion 
(OPE) for non-leptonic decays of heavy quarks, while the production formalism assumes 
factorization, which is only proven, and in some applications of production this is not even 
the case, in perturbation theory ||. The trustworthiness of factorization depends upon the 
particular application, as we will discuss further in the body of the paper. We have reviewed 
these results here to emphasize the point that when we test this theory we are really testing 
both the factorization hypothesis as well the validity of the effective theory as applied to 
the J/i/j system. Thus, we must be careful in assigning blame when we find that our theory 
is not agreeing with the data. 

While NRQCD has allowed for successful fits of the data (in particular we have J/ip 
and ip' production at the Tevatron in mind), its predictive power has yet to stand any 
stringent test.g Indeed, one robust prediction of the theory, namely that production at large 
transverse momentum is almost purely polarized [|7|-||, seems to be at odds with the initial 
data.0 Other predictions such as the ratio of X1/X2 in fixed target experiments and the 
photon spectrum in inclusive radiative decays also seem to disagree with the data, as we 
shall discuss in more detail below. We are left with two obvious possibilities: 1) The power 
counting of NRQCD does not apply to the J/ip system. 2) Factorization is violated "badly" , 
meaning that there are large power corrections. The purpose of this paper is to explore the 
first possibility. 

If we assume that NRQCD does not apply to the J/ip system, then we must ask: is there 
another effective theory which does correctly describe the J / ipl One good reason to believe 
that such a theory does exist is that NRQCD, as formulated, does correctly predict the 
ratios of decay amplitudes for exclusive radiative decays. Using spin symmetry the authors 
of [0] made the following predictions: 



One simple test, which has yet to be performed, is to compare the values of the decay and 
production singlet matrix elements, which are predicted to be equal at leading order in w [||. To 
date the production singlet matrix elements have yet to be extracted and compared to the decay 
singlet matrix elements. These extractions can easily done using the direct J/tp production data 
at CLEO §. 

2 The data still has rather large error bars, so we should withhold judgment until the statistics 
improves. 
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r(x c0 -> W + 7) : r(xci -> + 7) : r(x c2 -> J/V + 7) : r(h D -> r? c + 7 ) 

= 0.095 : 0.20 : 0.27 : 0.44 (theory) 

= 0.092 ± 0.041 : 0.24 ± 0.04 : 0.27 ± 0.03 : unmeasured (experiment). (2.3) 

Thus, we would like to find an alternative formulation (power counting) of NRQCD which 
preserves these predictions yet yields different predictions in other relevant processes. Before 
discussing this alternative power counting, we must briefly review the standard formulation. 

The rest of this paper is structured as follows. We will first review the standard power 
counting used for predictions to date. Then we will offer a new power counting and discuss 
how the two theories differ in their treatment of several relevant observables, as well as how 
the theories fair against the data. We close with some remarks regarding the validity of 
factorization in various observables. 



A. NRQCD power counting 



The power counting depends upon the relative size of the four scales (m, mv, mv 2 , Aqcd)- 
If we take m > mv > mv 2 ~ Aqcd, then the bound state dynamics will be dominated by 
exchange of Coulombic gluons with (E ~ mv 2 ,p = mv). This hierarchy has been assumed in 
the NRQCD calculation of production and decay rates and is most probably the reasonable 
choice for the T system, where mv ~ 1.5 GeV. However, whether or not it is correct for the 
J/ip, where mv ~ 700 MeV remains to be seen. 

The power counting can be established in a myriad of different ways. Here we will follow 
the construction of [10], which we now briefly review. There are three relevant gluonic 



modes [11]: the Coulombic (mv 2 ,mv), soft (mv,mv) and ultrasoft (mv 2 ,mv 2 ). The soft 
and Coulombic modes can be integrated out leaving only ultrasoft propagating gluons. In 
the process of integrating out these modes we must remove those large modes from the quark 
field. This is accomplished by rescaling the heavy quark fields by a factor of exp(ip ■ x) and 
labeling them by their three momentum p. The ultrasoft gluon can only change residual 
momenta and not labels on fields. This is analogous to HQET, where the four-velocity 
labels the fields and the non-perturbative gluons only change the residual momenta ||12|| . 
This rescaling must also be done for soft gluon fields [|IB|] which, while they cannot show up 
in external states, do show up in the Lagrangian.^ After this rescaling a matching calculation 
leads to the following tree level Lagrangian []TD| 



<?,<? p.p 



Q 



A A 

Si ql , Jig 



g^(q' -p + p'Y -g»°(q-p + p') u + g^(q-q'f /t 

+ — To. 



vP - P) 



A u A^ 



+4> ^ x, r ^ f + E ^"^ V^^VpX-q^Vp + • • • 



p,q 



(p - q)' 



(2.4) 



3 Thus the nomenclature is slightly misleading since we have not removed these fields from the 
Lagrangian. 



where we have retained the lowest order terms in each sector of the theory. The matrices 
T A and T A are the color matrices for the 3 and 3 representations, respectively. Notice that 
the kinetic piece of the quark Lagrangian is just described by a label. This is a result of the 
dipole expansion which is used to get a homogeneous power counting. The last term 
is the Coulomb potential, which is leading order and must be resummed in the four-quark 
sector, while the other non-local interactions arise from soft gluon scattering. 

Now all the operators in the Lagrangian have a definite scaling in v. The spin symmetry, 
which will play such a crucial role in the polarization predictions, is manifest. The two 
subleading interactions which will dominate our discussion are the "electric dipole" (El) 

Cei = • M P , (2.5) 

and "magnetic dipole" (Ml) 

<r ■ B 

Cmx = Cir# P — — Vp- ( 2 - 6 ) 

The El interaction is down by a factor of v while the Ml is down by a factor of v 2 . The extra 
factor of v stems from the fact that the magnetic gluons are ultrasoft,^ and the derivative 
operator therefore picks up a factor of v 2 . These operators play a crucial role in the so-called 
octet mechanism. 



B. New Power Counting 

Let us now consider the alternate hierarchy m > mv ~ Aqcd- One might be tempted 
to believe that in this case the power counting should be along the lines of HQET, where 
the typical energy and momentum exchanged between the heavy quarks is of order Aqcd- 
However, this leads to an effective theory which does not correctly reproduce the infra-red 
physics. With this power counting, the leading order Lagrangian would simply be 

£hqet = V4A)VV (2.7) 

where the fields are now labeled by their four velocity. This is a just a theory of time- 
like Wilson lines (static quarks) which does not produce any bound state dynamics. Thus 
we are forced to the conclusion that the typical momentum is of order Aqcd, whereas the 
typical energy is AQ CD /m. The dynamical gluons are now all of the type (Aqcd, Aqcd), as 
the on-shell ultrasoft modes get cut-off by the confinement scale. Therefore, one no longer 
labels fields by their three velocities. The only label is the four velocity of the heavy quark. 
However, the D 2 / (2m) is still relevant and their is no dipole expansion. We can not resist 



4 One may wonder why the emission of a soft gluon cannot lead to the enhancement of the magnetic 
transition operator. However, the emission of such a gluon leaves the quark off-shell and contributes 
a pure counter-term to the matching |fj]. 
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the temptation of introducing yet another acronym,f] and call this theory NRQCD C , while we 
will refer to the traditional power counting as NRQCD?, as we assume that it does describe 
the bottom system.^ 

The power counting of this theory is now along the lines of HQET where the expansion 
parameter is Aqcd/^q- However the residual energy of the quarks is order AQ CD /mg, while 
the residual three momentum is Aqcd- Thus one must be careful in the power counting to 
differentiate between time and spatial derivatives acting on the quark fields. As far as the 
phenomenology is concerned, perhaps the most important distinction between the power 
counting in NRQCD C and NRQCD?, is that the magnetic and electric gluon transitions are 
now of the same order in NRQCD C . This difference in scaling does not disturb the successes 
of the standard NRQCD?, formulation but does seem help in some of its shortcomings. 



III. LIFETIMES 

In the case of inclusive decays the use of effective field theory put theoretical calculations 
on surer footing. Previous to the advent of NRQCD, inclusive decays were written as a 
product of a short distance decay amplitude and a long distance wave function which was 
usually taken from potential models |T7[] 

r JA H V>(0) \ 2 C(m,a s ). (3.1) 

Most of the time this formalism is adequate, however there is the question of the scheme 
dependence of the potential wave function beyond leading order. Beyond this drawback is 
the question of how to factor infra-red divergences in P wave decays. Within the effective 
field theory approach, however, these issues are clarified. The rate is now written as 

r W = E^ +li; (m,a s )(^ | O il)8) ( 2S+1 Lj) | V). (3.2) 

The operator matrix element gives the probability to find the quarks within the hadron in 
the state 2S+l Lj. The quarks can be either in a relative singlet or octet state, hence the 
subscript (1,8). The matrix elements are well defined scheme dependent quantities, which 
can be measured on the lattice, or extracted from the data [|T^], and have definite scalings in 
v. For instance, consider the operator (xj \ Oi( 3 Pj) \ xj)- We would expect this operator 
to dominate the decay of x states, given that the quantum number of the short distance 



quark pair match the quantum number of the final state. However, this is not the case [|19 
The operator (xj \ 0$( 3 Si) \ xj) is of the same order. This can be seen from the fact that 



We stole this bit of prose from [15|. 

6 In the language of |L6]], NRQCD?, would correspond to pNRQCD and NRQCD C would correspond 
to NRQCD. We chose to introduce these new acronyms because calling NRQCD C NRQCD would 
be misleading, since the original NRQCD, as defined in is indeed distinct from NRQCD C . We 
thus believe that our labeling will be the simplest for our purposes and hope the community will 
indulge us in our, what may be perceived as gratuitous, acronymization. 



6 



the P wave operator comes with two spatial derivatives. The octet S operator vanishes at 
leading order, since there is no S wave component in the leading order hadronic state in 
the effective theory. The first non-vanishing contribution comes from two insertion of El 
operators into time ordered products with 8 ( 3 5i). Thus both the singlet P wave operator 
and the octet S wave operator scale as v 2 . Furthermore, the inclusion of this operator into 
the rate allows for the proper absorption of infrared divergences in the P wave decays into 
octet S wave matrix elements. This should be considered a formal success of the effective 
field theory. Any change in the power counting will not change this success, as the scaling 
of an operator is independent of its renormalization group properties. Such a change could 
only effect the relevance of the infra-red divergence, in a technical sense. 

The advent of NRQCD had little impact on the phenomenology of inclusive decays 
because it simply justified previous calculations of the total width. However, one novel 
prediction of NRQCD was found in the end point spectrum of inclusive radiative decays 
PUPTfl . Radiative decays, as opposed to hadronic decays,[] have the advantage that they 
are subject to an operator product expansion, thus rely less upon local-duality assumptions. 
The integration over the photon energy smears through resonances and thus one may expect 
the prediction to be more trustworthy. We may reliably calculate the photon spectrum itself 
if we smear over regions of phase space which are larger than Aqcd P9 ■ 

In NRQCD, the decay of the J/ip is dominated by the 3 Si singlet operator with the 
octet operators being suppressed by v 4 n/a s . However, due to the singular nature of the 
octet Wilson coefficients (it is a delta function at leading order in a a ) they can become 
leading order near the end point of the spectrum. Of course we do not expect a delta 
function spike at the end point since the spike should be smeared out due to bound state 



dynamics, among other effects p2[ , which can be taken into account by the introduction of 
structure functions. 

In [^] it was shown that if one smeared the photon spectrum over a range of order 
mv 2 , then the spectrum would receive a leading order corrections from the octet matrix 
elements which are peaked at the end point. In the standard hierarchy such a smearing 
is satisfactory since it corresponds to smearing over mv in hadronic mass which is larger 
than Aqcd- However, in the new power counting this is no longer true and, given that the 
OPE breaks down in the region where the octet was suppose to dominate, it is no longer 
true that we can predict any peak with reliability. If we now consider the data, we see that 
for the J/ip the data is monotonically decreasing [p3| . On the other hand the T data does 



show a bump out at larger values of the photon energy. We wish to take this as support for 
the new power counting in the J/ip system. But we must be careful since there are other 
effects which become important at the end point which we have not taken into account. 
For instance, near the end point there are large radiative corrections which are known to 
resum into a Sudakov suppression. However, we would not expect this effect to completely 
eliminate the bump, just cut it off at larger energies. Nonetheless a complete calculation of 
the resummed Sudakov effects in the end point spectrum of T decays is needed. 



We are ignoring photon fragmentation for the moment. 
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IV. HADRO-PRODUCTION 



As discussed in section II a general production process may be written in the factorized 
form ( |2.2| ). The long distance part of the process involves the hadronization of the heavy 
quarks in the state n into the hadron of choice H . The matrix element in Eq. fl2.2|) is written 

as 

(0 | Of | 0) = (0 I ^T n ' x \ Y.H + X){H + X\ x tFV | 0) (4.1) 

x 

= W). 

The tensor T n operates in color as well as spin space and also contains possible derivatives. 
This tensor determines the order of the matrix element. If the quantum numbers n do 
not match the quantum numbers of the hadron, then the matrix element vanishes, as the 
hadronic states are those of the effective field theory, and are pure in the sense of a Fock 
space expansion. To get a non-vanishing result one must insert subleading operators into a 
time ordered product with the operator Of. The number and order of the inserted operators 
determine the scaling of the matrix element, as we detail below. 



A. Collider experiments 

The leading order contribution to ip production in the original v power counting scheme 
is through the color-singlet matrix element (Of ( 3 Si)), since the quantum numbers of the 
short distance quark pair matches those of the final state. All other matrix elements need the 
insertion of operators into time ordered products to give a non-zero result. Unlike the case 
of the x discussed earlier, all other matrix elements are suppressed compared to the color- 
singlet matrix element above. For instance, the matrix element (Og ( 1 5'o)) vanishes at leading 
order. The first non-vanishing contribution comes from the insertion of two Ml operators 
into time ordered products, thus giving a v 4 suppression. The scalings of the relevant matrix 
elements for ip production in NRQCDf, are shown in Table |. It appears from just the v 
counting that only the color-singlet contribution is important. But the other contributions 
can be enhanced for kinematic reasons. At large transverse momentum, fragmentation type 
production dominates pi] , and only the (Og ( 3 5i)) contribution is important. Without the 
color-octet contributions (i.e., the Color-Singlet Model), the theory is below experiment by 
about a factor of 30. By adding the color-octet contribution the fit to the data is very good 



25| . The new power counting must also reproduce this success. 

The relative size of the different matrix elements change in NRQCD C . In particular, the 
Ml transition is now the same order as the El transition. The new scalings are shown in 
Table | Due to the dominance of fragmentation at large transverse momentum, we 



need to include effects up to order (AQ CD /m c ) 4 , since the (Og ( 3 Si)) matrix element will still 
dominate at large pr- 

Is this consistent? The size of the matrix elements is a clue. Extraction of the matrix 
elements uses power counting to limit the number of channels to include in the fits. Cal- 
culating J/tp and ip' production up to order (Aq CD /?7i c ) 4 in NRQCD C requires keeping the 
same matrix elements as in NRQCD&. Previous extractions of the matrix elements only 
involve the linear combination 



S 





(Of( 3 Si)) 


<0?( 3 Si)> 


(oteso)) 


<^( 3 P )) 


NRQCD fe 


v° 








NRQCD C 


(A QCD /m c )° 


(AqcdM c ) 4 


(Aq C d/™ c ) 2 


(AqcdMc) 4 



TABLE I. Scaling of matrix elements relevant for tp production in NRQCD;, and NRQCD C . 

Mi = (Ot^Sn)) + —AOtfPj)), (4.2) 



with r m 3 — 3.5, since the short-distance rates have similar size and shape. In the new 
power-counting, we can just drop the contribution from (0 8 ( 3 Pj)), since it is down by 
(A-qcdA^c) 2 ~ 1/10 compared to (Of ( 1 5'o)). It is the same order as (0 8 ( 3 Si)), but is not 
kinematically enhanced by fragmentation effects. The extraction from |27j would then give 
for the J I ip and ip' matrix elements 



(O J a '*?S )) : (O^St)) 



(Ot'( 1 S )):(O!(- i S 1 )) 



(6.6 ± 0.7) x KT 2 : (3.9 ± 0.7) x 10~ 3 
(7.8 ± 3.6) x 10~ 3 : (3.7 ± 0.9) x 10~ 3 



17 : 1 
2 : 1. 



(4.3) 



Other extractions have various values of the hierarchy, ranging from 3 : 1 to 20 : 1 |28| . While 
the relation of the color-octet matrix elements in the J/tp system is indeed in agreement 
with the NRQCD C power counting, the ip' does not look to be hierarchical. However, it 
should be noted that the statistical errors in the ip' extraction, quoted above, are quite 
large. Furthermore, there are also large uncertainties introduced in the parton distribution 
function. The above ratios used the CTEQ5L parton distribution functions. If we take the 
central values from [fZT]] for the MRST98LO distribution functions, we find the ratio 3:1. 
On the other hand, the J/ip extraction is much less sensitive to the choice of distribution 
function. Given the statistical and theoretical errors, it clear that the ij}' ratio is not terribly 
illuminating. 

Let us now consider the extraction of these color-octet matrix elements in the T sec- 



tor ||29|| , where according to NRQCD5 power counting there is should be no hierarchy: 

~ 2 : (3.6d= l-9±i;i) x 10~ 2 



(0 8 T{35) (%)> 

<o 8 T{25) (%)> 
<o 8 T{15) (%)> 



(o 8 T(35) ( 3 ^)) 

(0 8 T(25) ( 3 5 l} ) 

(o 8 T(15) ( 3 ^)) 



+3.1\ 
2) 



(5.4 ± 4.3I£ 
(-10.8 ±9.7 
(13.6 ±6.8 



x 10" 



1:1, (4.4) 



-3.4\ 
2.0) 



x 10~ 2 : (16.4 ± 5.7±H) x 10" 2 « 1 : 1, 



™) x 10- 



: (2.0 ±4.1 



-0.6-v 
1-0.5J 



x 10" 



6 : 1. 



For the T(3S) and T(2S) we observe that there is indeed no hierarchy, while for the T(1S) 
it appears like there may be a hierarchy ||29|| . However, it is not possible to draw any strong 
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(oK 3 Pj)} 




(O 8 x (^o)) 


(o 8 x ( 3 Pj)> 




NRQCD 6 


v 2 


v 2 




V 6 


V 6 


NRQCD C 


(A QCD /m c ) 2 


(Aqcd/"I c ) 2 


(Aqcd/«i- c ) 4 


(A QCD /m c f 


(Aqcd/«i c ) 4 



TABLE II. Scaling of matrix elements relevant for \ production in NRQCDj, and NRQCD, 



conclusions from these data because the errors on the extractions are large. In fact the ratio 
for the T(IS') color-octet matrix elements is 1 : 1 within the one sigma errors. Further- 
more, these matrix elements are those extracted subtracting out the feed down from the 
higher states. Thus, the extraction of the T(IS') matrix elements actually have larger errors 
since the errors accumulate when we make the subtractions of the feed down components. 
Finally, we should add that, while phenomeno logically it is perfectly reasonable to define 
the subtracted matrix elements, we believe that, since the matrix elements are inclusive, 
one should not subtract out the feed down from hadronic decays when checking the power 
counting. In principle this subtraction should not change things by orders of magnitude, 
but nonetheless can have a significant effect. Indeed, if one compares the ratios for inclusive 
matrix elements, which do not have the accumulated error, then the ratios come out to be 
1 : 1, even for the T{IS) [f^Q 

B. Fixed-target experiments 

There are several phenomenological differences between NRQCD C and NRQCD;, in fixed 
target experiments pT| , |32fl . Here we will focus on ip production and the predicted ratio of 
production cross sections <y(xi)/ cr (X2) in NRQCD C . 

At order a 2 , ips are produced via quark- ant iquark fusion through the (0$ { 3 Si)) matrix 
element and through gluon fusion through the (Og ( 1 S , )) and (Og ( 3 P )) matrix elements, in 
the linear combination Mf . At fixed-target energies, the contribution to ip production from 
(Og ( 3 Si)) is numerically irrelevant because gluon fusion dominates. The difference between 
the NRQCD C prediction and the NRQCD;, analysis done in []3l| lies in the expected size of 
the matrix elements in Mf (called A 8 (■?/>) in JHj]), since in NRQCD C the 3 Pq matrix element 



8 The authors of |30| have values of the extracted matrix elements that differ from p9| . They use 
PYTHIA to model initial state gluon radiation, and use data at small values of pr in the extraction. 
Since we are worried about breakdown of factorization, we prefer to restrict our analysis to data 
points where factorization should hold. 
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is down by (Aqcd / 'fn c ) 2 ■ However, since the 3 Po matrix element is enhanced by a factor of 7, 
it is important to keep this formally subleading contribution. Furthermore, there is a very 
large scale and PDF dependence in these extractions, so it is not clear whether or not we 
can learn anything from comparisons with the Tevatron extractions. 

The X1/X2 production ratio has the nice property that it is relatively insensitive to the 
charmed quark mass |3T|. xi production is suppressed as it can not be produced at leading 
order in the singlet channel. The formally leading order xi channel is 3 S^ through quark- 
antiquark fusion. In both NRQCD?, and NRQCD C this formally leading order contribution 
is actually smaller than the subleading contributions coming from other octet operators 
((Of ( 3 P )) and (Of ( 3 P 2 )) in NRQCD fe , and (Of (%)) in NRQCD C ) due to the fact that 
these other channels are initiated by gluon-gluon fusion. The scalings for the x matrix 
elements are shown in Table [T1]. If we ignore the quark initiated process then due to simplicity 
of the 2 — > 1 kinematics we may write the NRQCD& prediction as 

/ „ 75 (Of (%)K + 3(Of ( 3 Pq)) + 40f ( 3 P 2 )) 
<Wx2- 32 (Of( 3 P 2 )> ' 1 ' 

where numerically small contributions have been dropped. This ratio is approximately 1/3 
if we take take v 2 ~ 0.3. Also, the ratio is independent of the center of mass energy, which 



agrees with the data within errors [[33 



In NRQCD C we have (again neglecting the numerically small quark-antiquark initiated 
processes) 

where, once again, numerically small contributions have been dropped. If we take 
Aqcd/^c ~ 1/3 we get approximately the same result. This estimate is so crude that 
it is not clear whether any information can be gleaned from it. However, it does seem that 
in either description the data [34| is, on the average, larger than these naive predictions.^ 



This could very well be due to large non-factorizable contributions, which we may expect to 
be enhanced in NRQCD C (see the conclusions). 

V. POLARIZATION 

J ftp and ip' are predicted to be transversely polarized at large pt in NRQCD fe . At large 
transverse momentum, the dominant production mechanism is through fragmentation from 
a nearly on shell gluon to the octet 3 Si state. The quark pair inherits the polarization of 
the fragmenting gluon, and is thus transversely polarized [?). In NRQCDj, the leading order 
transition to the final state goes via two El, spin preserving, gluon emissions. Higher order 



perturbative fragmentation contributions M, fusion diagrams [35, 3q], and feed-down for the 



9 One robust prediction, however is that the ratio should be independent of s, which does seem to 
agree with the data. 
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J I if) |27] dilute the polarization some, but the prediction still holds that as px increases so 
should the transverse polarization. Indeed, for the if>', at large pt 3> m c , we expect nearly 
pure transverse polarization. 

The polarization of J /if) and if)' at the Tevatron has recently been measured with 
large error bars. The experimental results show no or a slight longitudinal polarization, 
as pt increases. If, after the statistics improve, this trend continues, then it will be the 
smoking gun that leads us to conclude that NRQCD& is not the correct effective field theory 
for charmonia. 

With NRQCD C , the intermediate color-octet 3 Si states hadronize through the emission 
of either two El or Ml dipole gluons, at the same order in l/m c . Since the magnetic gluons 
do not preserve spin, the polarization of if) produced through the (0$ ( 3 Si)) can be greatly 
diluted. The net polarization will depend on the ratio of matrix elements 

Rm/e— 



IUed A x e (0\ 


T(M 1 ( 


x 1 )M 1 {x 2 )^T a (T i x) 4 a H T{M 1 {x 3 )M 1 {x 4 ) X ^T a a i if)) 


10) 


JIU<Px t (0 


\T{E 1 


{x 1 )E 1 {x 2 )if)^a iX ) 4 a H T(E 1 (x 3 )E 1 (x 4 ) X ^a i iP) \ 


0) 



(5.1) 



where 

a j H a H = J2\ H + X}(H + X \ . (5.2) 

x 

The leads to the polarization leveling off at large pt at some value which is fixed by Rm/e- 
In Fig. H], we show the prediction for J /if) and if)' polarization at the Tevatron. The data is 
from |2j. The three lines correspond to different values for R M / E =(0 (dashed), 1 (dotted), 
oo (solid)). The dashed line is also the prediction for NRQCD fe . The residual transverse 
polarization for J /if) at asymptotically large px is due to feed down from X states. The 
non-perturbative corrections to our predictions are suppressed by AQ CD /m 4 . 



VI. B DECAYS 



Another useful observable for differentiating between NRQCD C and NRQCD& is char- 
monia production in B decays. Assuming perturbative factorization, the if) production rate 
from semi-inclusive B decays may be written as 



V(B -> H + X) = C(b -> cc[n] + x) (O*). 



(6.1) 



This expression is valid up to power corrections of order Aqcd/^^c, which parameterize the 
non-factorizable contributions. (To this accuracy it is justified to treat the B meson as a 
free b quark.) The short distance coefficients are determined by the AB = 1 effective weak 
Hamiltonian 



G 



-WEWiW ( 6 - 2 ) 

i=3 J 



containing the 'current-current' operators 



12 



0.5 



-0.5 

-1 
1 





0.5 

















-0.5 




-1 



10 



15 



20 



10 



15 



20 



P T (GeV) 



FIG. 1. Predicted polarization in NRQCD C for J /tp and tp' at the Tevatron as a function of px- 
The three lines correspond to R M / E =(0 (dashed), 1 (dotted), oo (solid)). The dashed line is also 
the prediction for NRQCDj. 



= [c 7m (1 - 7s)c] [67^(1 - 75 )g], (6.3) 
O s = [51^(1 - 7s)c] [6TV(1 - Tte)?], (6-4) 

and the QCD penguin operators 03-6 (precise definitions may be found in the review [ft?!). 
For the decays B — > charmonium +X it is convenient to choose a Fierz version of the current- 
current operators such that the cc pair at the weak decay vertex is either in a color-singlet 
or a color-octet state. The coefficient functions are related to the usual C± by 

C [1] Qi) = 2C + Qi)-C-Qj.), (6.5) 
C [b] (ji) = C + (jj,) + C-(ji,). (6.6) 

In NRQCDf, a naive power counting leads to the conclusion that the leading order result 
is fixed by the (0i( 3 Si)) operator as all octet operators are suppressed by t> 4 . However, as 
pointed out in Ref. f38fl , the fact that the Wilson coefficients evaluated at the low energy 
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scale are numerically hierarchical, Cf/Cf ~ 15, actually leads to octet domination. In 
NRQCDf, one would then get leading order contributions from all the octet matrix elements, 
(Og / ^( 3 5 1 )), (O^fSo)) and (Og^fPo)), where the contribution from the other 3 Pj states 
have been written in terms of the 3 Pq contribution using spin symmetry. 

In NRQCD C , the leading order octet contribution comes solely from the 1 5'o operator 
which is suppressed by Aq CD /7?i 2 . Thus, we may get a direct extraction of this matrix 
element from the decay rate. However, at leading order in a s the color-singlet contribution 
is highly scale dependent. This is due to large scale dependence in the value of C\(p). The 
authors of [^T|] found that the leading order singlet contribution varies by a factor of ten 
as /i is varied between 2.5 and 10 GeV. This scale dependence can be drastically reduced 
by working at next-to-leading order (NLO) and using a combined expansion in a s and the 
ratio Ci/Cs |39|]. Using this expansion a NLO order calculation found that within NRQCD& 
power counting one could extract the linear combination [4^ 



M i> /-io(B) 3 P (8)x _ / 1.5 ■ 10 2 GeV 3 (J/VO , fi 7 s 

M 3 A \ , Kj ) ~ j q . 10 -2 GeV 3 (^). (P- 7 ) 

In NRQCD C the result is all spin singlet and we would thus conclude that (O^^Sq)} = 
1.5 • 10~ 2 , which is quite a bit smaller than the Tevatron extraction . 

We may also consider how the new power counting effects the prediction for the polar- 
ization of the ip. In NRQCDf, the prediction for the polarization, at leading order in a s , was 
given in [Q. The angular distribution in the leptonic J /if? decay may be written as 

(V> -> fM + fJ.~){0) oc 1 + a cos 2 9, (6.8) 



dcos9 



where the angle 9 is defined in the J/ip rest frame for which the z-axis is aligned with 
direction of motion of the J/tp and 

„(+)+„(-) -2,(0) 

Within NRQCD& the authors found that a lies within the range —0.4 < a < —0.1 if the 
bottom quark mass lies between 4.4 and 5.0 GeV and the octet matrix elements are allowed 
to vary within their errors. This rather crude leading order prediction should be reasonable 
as long as the scale dependent singlet piece is not dominant, which it is not. 

In NRQCD C , given the color-octet 1 So dominance, we would expect a quenching of the 
polarization, since, as discussed in the case of hadro-production, the spin flipping hadronic 



transition involved in the matrix element obeys helicial democracy. Using the results of |4T 
we may write down the order a s NRQCD C prediction for a 

a = -0-39(0^(^1)) (Q 1Q) 

(0 1 W (3^ 1 ))+61(0 8 W ( 1 5o))' 

where we have kept the formally subleading 1 Sq contribution in the denominator because 
of its large coefficient. Since the singlet contribution to the polarization is now leading 
order, we need to be concerned about the scale dependence discussed above. Indeed, a 
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NLO calculation, in the modified double expansion scheme, is in order. If, as in the case 
of the polarized rate, the octet dominates, then we would expect only a slight longitudinal 
polarization. Note that the NRQCD C polarization prediction has the advantage that it only 
depends on one unknown matrix element, so once the NLO calculation has been done, the 
prediction will be comparatively robust. 



VII. CONCLUSION 

There are several relevant questions to ask. Is there any reason to believe that there is any 
effective theory to correctly describe the J / i\P. We believe that the spin symmetry predictions 
for the ratio of \ decays clearly answers this question in the affirmative. Assuming that 
such an effective theory exists, then is it NRQCD C or NRQCD^? As we have shown the two 
theories do indeed make quite disparate predictions, which in principle should be easy to 
test. However, these tests can be clouded by the issues of factorization and the convergence 
of the perturbative expansion. 

One would be justified to worry about the breakdown of factorization in hadro-production 
at small transverse momentum. Indeed, in NRQCD C where the time scale for quarkonia 
formation is assumed to be the same as the time scale for the hadronization of the remnants, 
it seems quite likely that there could be order one corrections to factorization.^ Thus any 
support, or lack thereof, for the theory coming from these processes should be taken with 
a grain of salt. On the other hand, for large transverse momentum one would expect 
factorization to hold, with non-factorizable corrections suppressed by powers of m c /pT- 

As far as the perturbative expansion is concerned, it seems that for most calculations 



the next-to-leading order results are indeed smaller than the leading order result [31,42,43 



However, the one NNLO calculation performed, in the leptonic decay width ||44|| , is not 



well behaved at this order, which is worrisome. But, to truly test the convergence of the 
expansion we should take ratios of rates in order to eliminate the renormalon ambiguities 



45] . When this is done, it could very well be that the perturbative expansion is well behaved. 
Until another rate is calculated at NNLO we will have to be comforted by the fact that such 
cancellations have been seen to occur explicitly in other heavy quark decays j46 . 



With that said, let us gather the evidence in support of NRQCD C as being the proper 
theory for the J /if). If one is willing to accept that the extraction of the octet matrix 
elements from CDFj^] then the fact that the ratio is large for charmonia but seems to 
be small for bottomonia is rather compelling. If once the statistics in the bottomonia 
sector improve we find that there really is no hierarchy, then we believe that this would 
be strong evidence for our hypothesis. The fact that the J /if) radiative decay spectrum is 



10 This may be true as well in B decays. However, since most of the time the J/tp will be going 
out back to back with the remnants, one might expect factorization to be more accurate. 

11 This extraction is not free of factorization issues since the fit of the matrix elements involves 
use of data at rather small values of transverse momentum. However, if a cut at pt = 5 GeV is 
made, then the change in the fit is minimal. 
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monotonically decreasing while one sees a bump at larger energy in bottomonia also seems 
to lend credence to our hypothesis. However, the true litmus will come from the polarization 
measurements at large p?- The predictions of nearly 100% polarization in NRQCD 6 is quite 
robust. Whereas, in NRQCD C the polarization is diluted from Ml transitions of the O s ( 3 Si) 
operator. Unfortunately, the introduction of another unknown matrix element diminishes 
our predictive power. However, this is not to say that we can not rule out NRQCD C . Indeed, 
NRQCD C also predicts a leveling off of the polarization with positive a. So a measurement 
of longitudinal (or, for J /if), zero) polarization would indeed negate our hypothesis. 

We would like to close with a caveat. In particular, it should be pointed out that NRQCD C 
does not become exact in any limit. Typically, in an effective field theory, we expect that 
the ratio of sub-leading to leading contributions should vanish in a given limit of QCD. This 
gives us confidence that the theory MUST be correct in asymptotia. Whether or not the 
real world leads to a well behaved expansion though, is another question. For NRQCD C we 
might hope that as we take the limit Aq CD /m — > 0, we necessarily get the correct answer. 
However, in this limit the soft modes become perturbative and the power counting changes. 
That is, in this limit the state becomes Coulombic and NRQCD& becomes the correct theory. 
It may well be the case that in some observables the NRQCD C expansion is well behaved and 
in others it is not. Given that the expansion parameter is around 1/3, it seems reasonable 
to be confident in those predictions for which the corrections are suppressed by at least 
AQ CD /m 2 (modulo the convergence of the perturbative expansion), as are the predictions 
discussed in this paper. 
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